On the Complexity of Grammar-Based Compression over Fixed Alphabets

نویسندگان

  • Katrin Casel
  • Henning Fernau
  • Serge Gaspers
  • Benjamin Gras
  • Markus L. Schmid
چکیده

It is shown that the shortest-grammar problem remains NP-complete if the alphabet is fixed and has a size of at least 24 (which settles an open question). On the other hand, this problem can be solved in polynomial-time, if the number of nonterminals is bounded, which is shown by encoding the problem as a problem on graphs with interval structure. Furthermore, we present an O(3) exact exponential-time algorithm, based on dynamic programming. Similar results are also given for 1-level grammars, i. e., grammars for which only the start rule contains nonterminals on the right side (thus, investigating the impact of the “hierarchical depth” on the complexity of the shortest-grammar problem). 1998 ACM Subject Classification F.2.2 Nonnumerical Algorithms and Problems, E.4 Coding and Information Theory

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Smallest Grammar Problem Revisited

In a seminal paper of Charikar et al. on the smallest grammar problem, the authors derive upper and lower bounds on the approximation ratios for several grammar-based compressors, but in all cases there is a gap between the lower and upper bound. Here we close the gaps for LZ78 and BISECTION by showing that the approximation ratio of LZ78 is Θ((n/ logn)), whereas the approximation ratio of BISE...

متن کامل

Descriptional complexity measures of context-free languages

In [2], [3] and [4] several measures of descriptional complexity of context-free grammars (cfg's) and context-free languages (cfl's) have been investigated, most of them having the following properties: 1. The corresponding hierarchy of complexity classes of languages over two-letter alphabets is infinite. 2. The basic algorithmic problems are undecidable. (For example, the problems to determin...

متن کامل

Block-Based Compressive Sensing Using Soft Thresholding of Adaptive Transform Coefficients

Compressive sampling (CS) is a new technique for simultaneous sampling and compression of signals in which the sampling rate can be very small under certain conditions. Due to the limited number of samples, image reconstruction based on CS samples is a challenging task. Most of the existing CS image reconstruction methods have a high computational complexity as they are applied on the entire im...

متن کامل

Grammar-based codes: A new class of universal lossless source codes

We investigate a type of lossless source code called a grammar-based code, which, in response to any input data string over a fixed finite alphabet, selects a context-free grammar representing in the sense that is the unique string belonging to the language generated by . Lossless compression of takes place indirectly via compression of the production rules of the grammar . It is shown that, su...

متن کامل

Comparative Impacts of Mindsettings on EFL Learners' Grammar Achievement

The present study was conducted to investigate the comparative impacts of three types of EFL teach- ers' mindsettings on EFL learners' grammar achievement. The participants of the study were English Translation undergraduate students (both female and male with the age ranging of 18-35) who were selected according to convenience non-random sampling from three classes of English Grammar 1 at both...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016